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Traditional particle identification methods are time consuming, experience-dependent, and poor repeatability 
challenges in heavy-ion collisions at low and intermediate energies. Researchers urgently need solutions to the 
dilemma of traditional particle identification methods. This study explores the possibility of applying intelli- 
gent learning algorithms to the particle identification of heavy-ion collisions at low and intermediate energies. 
Multiple intelligence algorithms, including XgBoost and TabNet, were selected to test datasets from the neutron 
ion multi-detector for reaction-oriented dynamics (NIMROD-ISiS) and Geant4 simulation. Machine learning 
algorithms based on tree structures and deep learning algorithms e.g. TabNet show excellent performance and 
generalization ability. Adding additional data features besides energy deposition can improve the algorithm’s 
identification ability when the data distribution is nonuniform. Intelligent learning algorithms can be applied to 


solve the particle identification problem in heavy-ion collisions at low and intermediate energies. 


Keywords: Heavy-ion collisions at low and intermediate energies, Machine learning, Ensemble learning algorithm, Particle 


identification, Data imbalance 


I. INTRODUCTION 


Intelligent algorithms play crucial roles in nuclear physics. Challenges in nuclear physics experiments include high complex- 
ity, extensive data, time-consuming experiments, and intricate models. Taking particle collision experiments as an example, 
millions of terabytes of data are generated daily for heavy-ion collisions at high energies. Therefore, the extraction of useful 
information from complex experimental data has become an enormous challenge. 

Large-scale experiments such as ATLAS, ALICE, and CMS have already applied machine-learning and deep-learning algo- 
rithms [1—4] to analyze and process experimental data. Typical examples include research on the particle-track reconstruction 
problem [5-8] in high-energy physics experiments, data analysis, and pattern recognition of the Higgs boson [9-13]. The appli- 
cation of machine learning in particle physics can be seen in a large-scale dynamic review [https://iml-wg. github.io/HEPML- 
LivingReview/] and the website opened by the ML Physics Portal [14-17]. 

Currently, research on intelligence algorithms in nuclear physics experiments [18—21] focuses on data analysis, such as the 
masses of atomic nuclei [22—26], nuclear charge radii [27-31], decay half-lives [32-37], critical reaction thresholds [38], and 
spallation reaction cross-sections [39], etc. In addition to using machine learning algorithms to investigate various physical 


issues [40-42], researchers have used these algorithms to analyze experimental data [43-45]. This involves tasks, such as 


* This work is supported by the Strategic Priority Research Program of Chinese Academy of Sciences (No. XDB34030000), the National Key Research 
and Development Program of China (No. 2022 YFA1602404), the National Natural Science Foundation (No. 11975210, No. U1832129), and the Youth 


Innovation Promotion Association CAS (No. 2017309). 
+ Corresponding author, Qian-Min Su, suqm @sues.edu.cn 


t Corresponding author, Xi-Guang Cao, caoxg @sari.ac.cn 


1s particle trajectory reconstruction, vertex reconstruction [46], and particle identification in nuclear reactions. Advancements in 
16 experimental equipment and related technologies have facilitated the integration of machine learning and nuclear physics. 

17 Current research on particle identification focuses on high-energy particle physics. To date, research on particle identification 
18 has mainly focused on identifying particle types [47] and separating rare particles from background signals. The data and algo- 
19 rithms used for particle identification depend on the type of detector. For example, the output data of a calorimeter detector can 
20 be processed and converted into matrix data; therefore, image algorithms, such as CNN and GNN, can be used for processing. 
21 The research and applications of machine learning in particle identification have mainly focused on LHC detectors, such as 
22 calorimeters [48-50] and Cherenkov detectors [51]. Moreover, a new research focus in recent years on LHC experiments has 
23 been the development of new detector software and hardware based on machine-learning and deep-learning algorithms [52]. 

24 Compared with other nuclear reactions, the particles generated in heavy-ion collisions at low and intermediate energies are 
235 Of various types and have complex energy distributions. Numerous fragments have similar charges and masses. Experiments 
æ on heavy-ion collisions depend on the energy resolution of the detector and require a detection array with large solid-angle 
27 coverage. Therefore, the identification of dozens or even hundreds of reaction products from independent detection units 
2 is challenging. Traditional particle identification methods include telescope [53], time-of-flight [54], magnetic spectrometer, 
29 Bragg spectroscopy, and pulse shape discrimination methods. These methods are often combined to improve identification 
30 ability, especially for heavy fragments with minor differences in charge and mass numbers between adjacent fragments. The 
31 performance of the traditional methods for heavier particles is hindered by their dependence on experience, poor repeatability, 
32 and time consumption. The precise identification of charge and mass numbers is fundamental to all research related to heavy-ion 
33 collisions, and is a very powerful method for studying exotic nuclear configurations [55-60]. Compared with particle identifi- 
34 cation in particle physics, the wide variety and slight differences in the charge and mass numbers of charged particles produced 
35 in heavy-ion reactions pose significant challenges for existing particle identification methods. Therefore, the development of 
36 a universal, efficient, and high-precision particle identification method based on machine learning techniques will significantly 
37 boost the study of heavy-ion collisions. 

3 Parker et al. [61] devised a 5-layer neural network and evaluated its performance on the 22nd and 23rd detectors of a neutron 
3 ion multi-detector for reaction-oriented dynamics (NIMROD-ISiS). We also used a dataset from NIMROD-ISiS detector array. 
4o This study aimed to identify the particle charge and mass numbers in heavy-ion collisions at low and intermediate energies. 
a Supervised learning algorithms were used to train particle identification models based on AE-E energy deposits from telescope 
42 (or super-telescope) detectors in heavy-ion collisions. Machine learning and deep learning algorithms were applied to identify 


43 the particles’ charge and mass numbers, and their capabilities were compared. 


44 Il. DATASET AND METHODS 


4s Real-world data (RWD) come from experiments on heavy-ion collisions at low and intermediate energies carried out at the 
4s Cyclotron Institute of Texas A&M University and consist of reaction products detected by the NIMROD-ISiS array [62, 63]. 
47 The NIMROD-ISiS detector array comprised 14 rings. Experimental data were obtained from 143 detectors, including 124 
4s telescope detectors and 19 super telescope detectors with ring numbers ranging from 2 to 15. The detection system included Si 
a detectors and CsI (TI) scintillators with angles ranging from 3.6° to 167.0°. The back half of the NIMROD (90.0° - 167.0°) 
so consists of half the Indiana Silicon Sphere. Si detectors were combined with the CsI detectors as ’telescopes,’ while some 


st were equipped with two Si detectors in tandem, known as ’super telescopes’ (3.6°-45°), enhancing the ability to identify mass 


s2 numbers of heavier fragments. The capacity to include ionization chambers in front of Si detectors is also available. Fig. 1 


s3 Shows the structure of the NIMROD-ISIS. 


Fig. 1. Schematic diagram of the NIMROD-ISiS detector array layout, from Texas A&M NIMROD-ISiS official website 
(https://cyclotron.tamu.edu/nimrod/) 


s4 In addition to the dataset from Texas A&M University, Geant4 [64] was used to simulate heavy-ion collisions at intermediate 
ss energies. QMD model with G4IonQMDPhysics was used as an event generator to simulate the reaction process of a beam inci- 
se dent on a target. The detection processes in Geant4 include electromagnetic interactions (G4EmStandardPhysics), energy trans- 
s7 fer and loss (G4EmExtraPhysics and G4StoppingPhysics), decay processes (G4DecayPhysics and G4RadioactiveDecayPhysics), 
ss and elastic and inelastic scattering (G4HadronHElasticPhysics, G4HadronPhysicsINCLXX, and G4lonElasticPhysics). The 
so simulations involved collisions of 7°Si with an energy of 50 MeV/u and 1?C particles in vacuum. The detector system consisted 
eo Of four supertelescope detectors. The simulation generated a dataset with more than four million particles. Fig. 2 shows the 


et structure of the detector system and the AE-E two-dimensional histogram. 
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Fig. 2. The structure of the super telescope detector used in the Geant4 simulation and the AE-E two-dimensional histogram from Geant4 
simulation. 


62 This study covers serval common machine learning algorithms, such as Support Vector Machines (SVM), Logistic Regression 
e (LR), and Bayesian classifiers. Ensemble learning algorithms based on tree structures and TabNet, a deep-learning algorithm, 
e4 were also used. The algorithms used in this study are briefly described below. 

65 (a) MLP 

es Multi-layer perceptron (MLP) is a feed-forward neural network composed of multiple neurons, which is the basis and proto- 
67 type of many artificial and deep learning neural networks. 


es (b) Random Forest 


e Random forest is an early tree-based ensemble learning algorithm with multiple decision trees [65]. This offers advantages 
70 of both decision trees and ensemble learning. Strong robustness and predictive ability are also advantageous. 

7 (c) XgBoost 

72 XgBoost is a tree-based ensemble learning algorithm proposed in 2016 [66] that is widely used in data mining, natural 
73 language processing, image recognition, and other fields. In general, XgBoost is a machine-learning algorithm with high 
7 efficiency, accuracy, flexibility, explainability, and scalability. 

7s (d) LightGBM 

7s  LightGBM, a tree-based gradient boosting framework for ensemble learning, has been widely used in various applications 
77 [67]. Built on the gradient-boosting decision tree (GBDT) algorithm, LightGBM incorporates advanced techniques such as 
7s gradient-based one-sided sampling (GOSS) and histogram-based acceleration. These optimizations enabled faster training and 
79 lower memory consumption, making LightGBM an efficient and practical choice for machine learning tasks. 

so (e) CatBoost 

s1  CatBoost is a tree-based ensemble-learning algorithm developed by Yandex [68]. In terms of building a decision tree, 
s2 compared with XgBoost and LightGBM, CatBoost can automatically process the category features of the data and automatically 
s3 process the scaling of the data features without additional data processing. CatBoost adopts the same gradient-based splitting 
s4 and feature selection strategies based on a greedy algorithm as XgBoost. CatBoost also automatically handles missing values 
ss in the data without additional data padding and has a certain robustness to noise and outliers. 

ss | Boosting-based ensemble learning algorithms such as XgBoost, LightGBM, and CatBoost are widely used in various fields. 
s7 The basic process of these algorithms involves training multiple weak learners, assigning weights to training samples, and 
ss iteratively adjusting these weights based on the learner’s performance. This iterative process aims to create a powerful ensemble 
s model that is capable of accurate classification. Fig. 3 depicts the underlying structure of ensemble learning algorithms that 
əx employ the boosting method. 

a1 (f) TabNet 

ə  TabNet, which was introduced by Google in 2019, is a neural network structure explicitly designed for classification, pre- 
ə diction, and regression tasks involving tabular data [69]. Unlike traditional decision-tree-based machine learning algorithms, 
ə TabNet minimizes the need for preprocessing input data and can automatically learn the interdependencies among input fea- 
ə tures. It incorporates an attention transformer that uses an attention mechanism to select relevant feature vectors dynamically. 
æ Since its inception, TabNet has been widely adopted in various applications involving tabular data [70, 71]. 

ə Fig. 4 illustrates the procedure for applying the intelligent algorithms in this study. Training a classification model typically 
ə involves several steps. 

ə (a) Data acquisition: Obtaining a dataset containing information about particle charge and mass, which can be from experimen- 
1o0 tal or simulated data. 

101 (b) Data preprocessing: Ensuring the quality and consistency of the data through noise removal, addressing missing data, and 
102 normalizing features. 

103 (c) Data splitting: Dividing the dataset into training and testing sets. A training set was used to train the model and a test set 
1⁄4 Was employed to evaluate the trained model. Random and stratified sampling are the commonly used methods. 

105 (d) Feature engineering: Raw data is transformed, extracted, and selected to create informative and expressive feature sets. 

106 (e) Algorithm selection: Suitable algorithms are chosen based on specific task requirements. The main task of the algorithm is 


1o7 mult-classification. 
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Fig. 3. The structure of ensemble learning and boosting method. 


108 (f) Training and tuning of parameters: The algorithm’s parameters can be tuned to improve the performance of the model fur- 
109 ther. Generally, each algorithm has a unique set of parameters that can be adjusted. 


110 (g) The performance of the trained model is assessed using the testing dataset. Appropriate evaluation metrics were selected to 


111 evaluate the performance of the model for particle identification. 


“2 Based on the structure of NIMROD-ISiS, the dataset was initially split based on the ring number determined by the forward 


113 angle O e detector. ubsequently, e data were divided into two categories: telescope and supertelescope detectors. e 
gle of the detector. Subsequently, the dat divided into t g telescope and supertelescope d Th 


114 Geant4 dataset was used for training and testing with machine learning and deep learning algorithms. After identifying the 


115 Optimal algorithms, a subset of the detector data was used to evaluate the generalization ability of the algorithms. 


ne Two classification strategies are adopted in this study. 

117 (a) Using the algorithm to train and test charge and mass numbers, respectively. 

118 (b) In classifying particle mass numbers, the particle’s charge number was included as a part of the data features. From a logical 
119 perspective, this strategy is similar to traditional particle identification methods. 

120 In a practical study, the experimental data exhibited a highly unbalanced distribution. Randomly extracting data can lead to 
121 disparate data category distributions among the training, validation, and test sets. This can lead to critical code errors and poor 


122 performance. Therefore, to address this problem, stratified sampling was employed as an alternative to random sampling. 


123 Il. RESULTS AND DISCUSSION 


124 As the core task of particle identification involves multiple classifications, the use of suitable evaluation metrics for multiple 
125 Classification algorithms is crucial. Common evaluation metrics include the accuracy, recall, precision, and fl-score [72-76]. 
126 These metrics help assess the performance of the algorithm from different aspects. The results of the classification task can be 


127 Categorized into the following four types: 


N 


Fig. 4. The procedure of applying intelligent algorithms in this paper. The process is divided into three parts: datasets and algorithms, model 
training and testing, and evaluation of test results. Model training and testing is the most important part. 


128 (a) Predict positive samples as positive. (TP) 

129 (b) Predict negative samples as negative. (TN) 

130 (C) Predict negative samples as positive. (FP) 

131 (d) Predict positive samples as negative. (FN) 

132 When evaluating the algorithm, the corresponding evaluation metrics were calculated using the classification results. The 


133 equations are as follows: 


134 Accuracy =(TP+TN)/(TP+TN+FP+FN) (1) 
135 
136 Precision = (TP)/(TP+ FP) (2) 
137 
138 Recall = (TP)/(TP+ FN) (3) 
139 
140 F1 — score = 2 x Precision x Recall/(Precision + Recall) (4) 


141 Accuracy is defined as a measure of correctness. The precision measures the accuracy of a model in predicting positive 
142 examples. The recall represents the coverage of positive samples that are correctly predicted. The fl-score is a compound 


143 evaluation metric consisting of precision and recall. 


P 


144 Because the problem of positive and negative samples is extended to multiple categories in multi-classification tasks, meth- 
145 ods for computing comprehensively evaluated metrics are essential. Commonly used strategies are the micro average, macro 


148 average, and weighted average. 


D 


147 Macro-averaging calculates the average precision and recall of each class. 
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The micro-average ignores category differences and calculates the overall TP, FP, TN, and FN. 

The weighted average is similar to the macro average, but uses category proportions as weights to calculate performance 
metrics. 

In particle identification, all generated particles have equal significance. Therefore, the macro average was chosen as the 
calculation method for the evaluation metrics. The macro-average provides a balanced assessment across all classes and facili- 
tates a comprehensive understanding of model performance. Because the mass and charge determined the particle category, the 
charge and mass numbers were merged into a binary data format to calculate the evaluation metric. 

The particles detected by the NIMROD-ISiS detector array can be categorized as light ions (with proton numbers ranging 
from one to four) or heavy ions. Most heavy ions cannot penetrate the Si detector, whereas most light particles pass through 
it. The disparity in the production yield between light particles and heavy ions during the reaction process leads to imbalances 
in data distribution. The dataset was split based on whether the particles hit the CsI detector. This approach solves the data 
imbalance problem in particle analysis and improves the algorithm performance. The XgBoost ensemble-learning algorithm 
was selected for testing. The input data features were the total energy, energy deposition in the Si and CsI detectors, and the 
detector position. The charge and mass numbers of the particles were used as data labels. Fig. 5 shows the results of the model 


based on the telescope data. Table 1 shows the results of the model on the super-telescope data with two different classification 


strategies. 
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Fig. 5. Test results of XgBoost on NIMROD-ISiS telescope data. Figures (a) and (b) show the results of XgBoost on particles w/o registrations 
on CsI detector. The latter results are better than the former. 


The model performed well when tested on particles that were not registered on a CsI detector. The evaluation metrics for 
each ring generally exceeded 0.85. The model performs better in identifying charge numbers than mass numbers. It also 
achieves high accuracy for particles registered on CsI detector. However, their precision, recall, and fl-score were low. This 
discrepancy is attributed to extreme data imbalance. Fig. 6 shows the mass distribution of ring 2. The mass distribution of 
the particles registered on CsI detector was highly non-uniform. There was a significant difference between the categories with 
the highest and lowest counts. The precise identification of rare categories is challenging for this model. As the evaluation 


strategy uses a macro average, the evaluation metrics of the classifier are calculated as average values across all categories. 


CsIE 
Zero 
Zero 
Zero 
Zero 
Zero 
Zero 
Non-Zero 
Non-Zero 
Non-Zero 
Non-Zero 
Non-Zero 
Non-Zero 


Accuracy 


0.996 
0.934 
0.932 
0.997 
0.966 
0.964 
0.974 
0.892 
0.87 

0.974 
0.903 
0.881 


TABLE 1. Test results on NIMROD-ISiS SuperTelescope data. 


Precision Recall Fl-score Label 
0.996 0.957 0.969 Z 
0.874 0.844 0.856 A 
0.893 0.877 0.883 Z+A 
0.997 0.958 0.969 Z 
0.908 0.893 0.9 A 
0.924 0.916 0.919 Z+A 
0.473 0.402 0.425 Z 
0.3 0.244 0.261 A 
0.316 0.247 0.266 Zt+A 
0.468 0.401 0.425 Z 
0.326 0.274 0.289 A 
0.335 0.272 0.295 Z+A 


Strategy 
Independence 
Independence 
Independence 
FirstZ,SecondA 
FirstZ,SecondA 
FirstZ,SecondA 
Independence 
Independence 
Independence 
FirstZ,SecondA 
FirstZ,SecondA 
FirstZ,SecondA 


171 Thus, the performance of categories with small percentages significantly affected the overall evaluation metrics. The training 


172 Strategies for charge and mass numbers did not show any significant differences. Including the charge number as an additional 


173 data feature did not effectively improve the identification ability of the model for particles in deficient quantities. If the model 


174 fails to precisely predict the charge number of the particles, the accuracy of the mass number identification is also affected. 
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Fig. 6. Mass number distribution for events w/o registration on CsI detectors in ring 2. The mass number distribution of particles without 
registration on CsI detector is well balanced, with sample sizes exceeding 1000 in most categories. Among particles with registration on CsI 
detector, most heavy ions count around 100 occurrences. 


175 To address this problem, the following methods have been proposed: 


176 (a) Algorithm parameter optimization: Refining algorithm parameters (reducing the learning rate, increasing iteration numbers, 


177 expanding tree depth, etc.) to improve accuracy, precision, and recall. However, adjusting the parameters alone had a limited 


17s impact on the categories with limited samples, even when distinct weights were assigned to each category. 

179 (b) Data category adjustment: The imbalance ratio can be reduced by eliminating data categories that comprise only a few or a 
180 few dozen samples. 

181 (c) Exploration of data pre-processing methods: Trying out different approaches, including normalization, standardization, or 
182 no data pre-processing. 

183. The most effective solution to the severe shortage of samples in specific categories is to include additional data. This reduces 
184 the imbalance ratio and thus improves the accuracy. For instance, in ring 10, each category had over 20,000 samples and the 
185 imbalance ratio was only 5:1. XgBoost performed excellently, with the evaluation metrics for each type exceeding 0.9. 

1 Other factors, such as detector position and hardware conditions, such as temperature and electronic signal drift, can cause 
187 Scaling issues, thus affecting algorithm accuracy. To address this issue, Geant4 was used to simulate the experiment and detector 
188 performance, enabling a focused research to address the imbalance issue. 

189 In Geant4, the total particle energy, time of flight (ToF), kinetic energy before entering the detector, detector position, and 
190 particle deposition energy (Eabs) were selected as input data features. Testing with XgBoost demonstrated that the additional 
191 data features alleviated the data imbalance problem, resulting in excellent performance. 

12 To confirm that this is not limited to XgBoost alone, a comparison test was conducted with other machine-learning and deep- 
193 learning algorithms. The test results ( Fig. 7) confirmed the earlier findings. Tree-based machine learning algorithms, such 
194 aS XgBoost and deep learning TabNet demonstrated excellent performance, whereas traditional machine learning algorithms, 
195 Such as LR, SVM, and Bayesian classifiers, exhibited poor performance. 

19 These results validate the effectiveness of the proposed approach in mitigating data imbalances and highlight the superiority 


197 Of tree-based machine learning and deep learning algorithms in addressing this challenge. 
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Fig. 7. Test results of machine learning and deep learning algorithms on the Geant4 dataset. SVM, MNB, GNB, and LR perform poorly. MLP 
has relatively low precision and recall. Ensemble learning algorithms such as XgBoost and deep learning algorithm TabNet perform well. 


138 Subsequently, the algorithms were evaluated using only the energy deposition as the data feature. XgBoost, LightGBM, 


10 


199 CatBoost, and TabNet, which exhibited promising performances in prior tests, were selected for this assessment. The results 
200 demonstrated that each algorithm showed decreased accuracy in predicting the particle mass number. 

201 Based on these observations, a series of additional data features were selected for comparative analysis. Numerous tests 
202 have shown that particle flight time is important for improving the accuracy of the algorithm. Although this feature alone 
203 proved insufficient for charge and mass number identification, its combination with the energy deposition feature enhanced the 
204 identification capabilities of the algorithm. Detailed descriptions of the corresponding test results are presented in Table 2 and 


205 Table 3. 


TABLE 2. Classification results from Geant4 simulation data, with independent training on charge and mass numbers. 


Algorithm Accuracy Precision Recall Fl-score Feature 
XgBoost 0.127 0.045 0.079 0.05 ToF 
XgBoost 0.862 0.863 0.827 0.839 Eabs 
LightGBM 0.828 0.821 0.795 0.804 Eabs 
CatBoost 0.836 0.804 0.765 0.771 Eabs 
TabNet 0.813 0.837 0.762 0.791 Eabs 
XgBoost 0.97 0.986 0.963 0.971 Eabs,ToF 
LightGBM 0.947 0.95 0.936 0.943 Eabs,ToF 
CatBoost 0.948 0.948 0.914 0.926 Eabs,ToF 


TabNet 0.971 0.99 0.976 0.983 Eabs,ToF 


TABLE 3. Classification results from Geant4 simulation data, where charge number is one of the data features of dataset defined by mass 
number. 


Algorithm Accuracy Precision Recall Fl-score Feature 
XgBoost 0.127 0.056 0.079 0.051 ToF 
XgBoost 0.87 0.878 0.839 0.852 Eabs 
LightGBM 0.85 0.848 0.812 0.82 Eabs 
CatBoost 0.83 0.798 0.752 0.757 Eabs 
TabNet 0.828 0.854 0.794 0.821 Eabs 
XgBoost 0.971 0.987 0.965 0.972 Eabs,ToF 
LightGBM 0.952 0.949 0.943 0.945 Eabs,ToF 
CatBoost 0.948 0.947 0.906 0.918 Eabs,ToF 
TabNet 0.952 0.985 0.948 0.963 Eabs,ToF 


26 The final phase of the study involved a comprehensive investigation of the generalization ability of the algorithms. Unlike 
207 previous tests involving data from all detectors, this phase focuses on training models using a specific subset of detectors, 
208 reserving the remaining data for testing. The data features include the time-of-flight (ToF) and energy deposition. Various 
29 data preprocessing techniques, including normalization and standardization, were explored during the testing phase. Before 
210 model training, datasets from specific detectors were normalized and standardized using the MinMaxScaler and StandardScaler 
211 methods from the sklearn.preprocessing package in Python. These methods were also used to test data from other detectors 
212 before evaluating the trained model. However, these methods have a significantly negative impact on generalization ability. 
23 Therefore, no data preprocessing was performed. The results are shown in Fig. 8. 

24 The performance of the algorithms was excellent. The evaluation metrics of TabNet and XgBoost are mostly over 0.9 for 
25 all detector data (Fig. 9). These findings establish the efficacy of training models with robust generalization abilities even in 
z216 Situations with limited data availability. Overall, these results highlight the advantages and effectiveness of machine learning 
217 and deep learning algorithms, and demonstrate their potential for practical applications. 


zis Inspired by these findings, a similar study of data similarity was conducted on specific rings of NIMROD-ISiS. The input 
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Fig. 8. Generalization ability test results of the XgBoost algorithm on the Geant4 dataset using different data preprocessing methods. It can 
be noticed that both normalization and standardization severely reduce the model’s generalization ability. 
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Fig. 9. The test results of generalizability ability test of XgBoost, CatBoost, LightGBM, and TabNet. Figures (a), (b), (c), and (d), respectively, 
show their accuracy, precision, recall, and fl-score curves. The evaluation metrics of all algorithms exceed 0.8. The evaluation metrics of 
XgBoost and TabNet are mostly over 0.9. TabNet shows better generalization ability than ensemble learning algorithms. 


219 data features included total energy, energy deposition, and detector position. Through testing, it was discovered that, depending 


22 on the similarity of the data, the detectors of NIMROD-ISiS can be divided into groups. Taking the data (particles registered 


221 on CsI detector) from ring 9 as an example, ring 9 can be further divided into two groups of detectors (Fig. (a) and Fig. (b) of 


22 Fig. 10). The results depicted in Fig. 10 provide valuable information on the patterns and characteristics of the NIMROD-ISiS 


2233 detector. Moreover, they contribute to the optimization of algorithms and the improvement of data-processing methods. These 


224 findings also have significant implications for the study and enhancement of the detector array design and performance. Overall, 


223 the findings have practical implications. 
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Fig. 10. In the tests carried out on the NIMROD-ISiS ring 9, the predictions for the other detector data can be categorized into two scenarios, 
Fig. (a) and Fig. (b). A high degree of similarity can be observed between the detectors on ring 9. 


226 IV. CONCLUSION 


27 Particle identification in machine learning (ML) is an integrated problem. Researchers must consider various factors in- 
22 Cluding data selection, partitioning, feature engineering, preprocessing, algorithm selection, and parameter tuning. Traditional 
22 particle identification methods require significant manual effort and are limited by researchers’ experience and available time. 
23 Our study aims to develop a universal and adaptable particle identification model that assists in manual processes. Although 
231 achieving 100% accuracy may not be possible, ensemble learning algorithms have meaningful results, especially XgBoost. The 
232 conclusions are as follows: 

23 First, intelligent algorithms, particularly tree-based ensemble learning algorithms, can effectively identify particles in heavy- 
234 ion collisions at low and intermediate energies. This offers a viable alternative to traditional methods. 

23 Secondly, addressing data imbalances is crucial for particle identification. Severe data imbalances significantly affected the 
23 results. The solutions include ensuring sufficient data for a balanced distribution, adding additional data features beyond particle 
237 energy deposition, and constructing different identification models based on the detector structure. 

233 Third, training a specialized particle identification model using the existing data reduces the time and resources required 
239 for traditional particle identification. Laboratories conducting long-term, large-scale heavy-ion collision experiments can be 
240 beneficial. This paves the way for the development of a professional particle identification software. 

21 Finally, machine-learning algorithms can be used to study detector similarity, particularly in large-scale detector arrays with 
242 complex structures. 

243 | Combinations of supervised and unsupervised learning approaches should be explored in future studies. Other physics soft- 
244 ware such as NpTool [77] will also be used to simulate the experiments. NpTool is known for its efficient project management 
245 and simulation of various sophisticated detector arrays. 

26 Because Geant4 simulations are time consuming and resource intensive, there is a need to explore alternative approaches for 
247 generating particle collision data. Generative Adversarial Networks (GAN) [78] and Variational Autoencoders (VAE) [79] have 


248 Shown promise in generating simulated data for detectors in the field of high-energy physics [80-83]. Utilizing GAN and VAE 
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can reduce the time and resources required for massive amounts of simulated data, thereby making the process more efficient 


and accessible. 


Building on the excellent performance of TabNet, further investigations will include exploring additional deep-learning 
algorithms, such as DeepGBM [84] and GrowNet [85]. Moreover, we attempted to change the existing ensemble learning 
algorithm into a multi-output algorithm to classify the mass and charge numbers simultaneously. Our research aims to enhance 
the understanding of the detector system in sophisticated experiments, which can be used to explore interesting clustering 


phenomena in nuclei [86-89]. 


V. ACKNOWLEDGEMENTS 


The Cyclotron Institute of Texas A&M University is gratefully acknowledged for permission to use the NIMROD-ISiS data 
to train and test the machine learning algorithms. We are deeply indebted to Professor J. B. Natowitz (Texas A&M University) 


for valuable suggestions and critical review of this manuscript. 


[1] A. Kalweit, Particle identification in the ALICE experiment. J. Phys. G. Nucl. Partic. 38, 124073 (2011). 
[2] C. Zampolli, Particle identification with the ALICE detector at the LHC. (2012). 
[3] P. Križan, Particle identification at Belle II. J. INSTRUM. 9, C07018 (2014). 
[4] L.K. Graczykowski, M. Jakubowska, K.R. Deja et al., Using machine learning for particle identification in ALICE. J. INSTRUM. 17, 
C07016 (2022). 
[5] P. Calafiura, S. Farrell, H. Gray et al., TrackML: a high energy physics particle tracking challenge. IEEE 14th International Conference 
on E-Science (e-Science) 344-344 (2018). 
[6] C. Tüysüz, F. Carminati, B. Demirk6z et al., Particle track reconstruction with quantum algorithms. Epj. Web. Conf. 245, 09013 (2020). 
[7] O. Bakina, D. Baranov, I. Denisenko et al., Deep learning for track recognition in pixel and strip-based particle detectors. J. INSTRUM. 
17, P12023 (2022). 
[8] P. Goncharov, E. Schavelev, A. Nikolskaya et al., Ariadne: PyTorch library for particle track reconstruction using deep learning. AIP 
Conf. Proc. 2377, (2021). 
[9] T.Q. Chen, T. He, Higgs boson discovery with boosted trees. JMLR Work. Conf. Proc. 42, 69-80 (2015). 
[10] M. Azhari, A. Abarda, B. Ettaki et al., Higgs boson discovery using machine learning methods with PySpark. Procedia Comput. Sci. 
170, 1141-1146 (2020). 
[11] C. Adam-Bourdarios, G. Cowan, C. Germain-Renaud et al., The Higgs machine learning challenge. J. Phys.: Conf. Ser. 664, 072015 
(2015). 
[12] S.R. Ahmad, Technical report of participation in Higgs boson machine learning challenge. (2015). 
[13] A.E. Phoboo, Machine learning wins the Higgs challenge. (CERN Bulletin, 2014), https://cds.cern.ch/journal/ 
CERNBulletin/2014/49/News%20Articles/1972036. Accessed November 16, 2023. 


[14] University of California, Irvine. ML Physics Portal, http: //mlphysics.ics.uci.edu/. Accessed November 16, 2023. 

[15] M.J. Fenton, A. Shmakov, TW. Ho et al., Permutationless many-jet event reconstruction with symmetry preserving attention networks. 
Phys. Rev. D 105, 112008 (2022). 

[16] C. Shimmin, P. Sadowski, P. Baldi et al., Decorrelated jet substructure tagging using adversarial neural networks. Phys. Rev. D 96, 
074034 (2017). 


14 


285 [17] P. Baldi, P. Sadowski, D. Whiteson, Searching for exotic particles in high-energy physics with deep learning. Nat. Commun. 5, 4308 
286 (2014). 

287 [18] W.B. He, J.J. He, R. Wang et al., Machine learning applications in nuclear physics (in Chinese). Sci. Sin.-Phys. Mech. Astron. 52, 
288 252004 (2022). 

289 [19] W.B. He, Q.F. Li, Y.G. Ma et al., Machine learning in nuclear physics at low and intermediate energies. Sci. China Phys. Mech. Astron. 
290 66, 282001 (2023). 

29: [20] M. Zhou, Y.Q. Luo, H.C. Song, Applications of machine learning in relativistic heavy ion physics (in Chinese). Sci. Sin.-Phys. Mech. 
292 Astron. 52, 252002 (2022). 

293 [21] W.B. He, Y.G. Ma, L.G. Pang et al., High-energy nuclear physics meets machine learning. Nucl. Sci. Tech. 34, 88 (2023). 

204 [22] T.L. Zhao, H.F. Zhang, Neural network approach to improve the quality of atomic nuclei (in Chinese). Sci. Sin.-Phys. Mech. Astron. 52, 
295 252008 (2022). 

296 [23] M.R. Mumpower, T.M. Sprouse, A.E. Lovell et al., Physically interpretable machine learning for nuclear masses. Phys. Rev. C 106, 
297 L021301 (2022). 

293 [24] Z.M. Niu, H.Z. Liang, Nuclear mass predictions with machine learning reaching the accuracy required by r-process studies. Phys. Rev. 
299 C 106, L021303 (2022). 

300 [25] L. Neufcourt, Y.C. Cao, W. Nazarewicz et al., Bayesian approach to model-based extrapolation of nuclear observables. Phys. Rev. C 98, 
301 034318 (2018). 

302 [26] Z.P. Gao, Y.J. Wang, H.L. Lü et al., Machine learning the nuclear mass. Nucl. Sci. Tech. 32, 109 (2021). 

303 [27] J.Q. Ma, Z.H. Zhang, Improved phenomenological nuclear charge radius formulae with kernel ridge regression. Chinese Phys. C 46, 
304 074105 (2022). 

305 [28] X.X. Dong, R. An, J.X. Lu et al., Nuclear charge radii in Bayesian neural networks revisited. Phys. Lett. B 838, 137726 (2023). 

306 [29] R. Utama, W.C. Chen, J. Piekarewicz, Nuclear charge radii: density functional theory meets Bayesian neural networks. J. Phys. G. Nucl. 
307 Partic. 43, 114002 (2016). 

308 [30] S.J. Tao, L.F. Zhang, Q.Y. Zhang et al., Improved naive Bayesian probability classifier in nuclear charge radius prediction (in Chinese). 
309 Sci. Sin.-Phys. Mech. Astron. 52, 252009(2022). 

310 [31] Y.F. Ma, C. Su, J. Liu et al., Predictions of nuclear charge radii and physical interpretations based on the naive Bayesian probability 
311 classifier. Phys. Rev. C 101, 014304 (2020). 

312 [32] Z.M. Niu, H.Z. Liang, B.H. Sun et al., Predictions of nuclear -decay half-lives with machine learning and their impact on y-process 
313 nucleosynthesis. Phys. Rev. C 99, 064307 (2019). 

314 [33] J.M. Munoz, S. Akkoyun, Z.P. Reyes et al., Predicting G-decay energy with machine learning. Phys. Rev. C 107, 034308 (2023). 

315 [34] Z.Y. Yuan, D. Bai, Z.Z. Ren et al., Theoretical predictions on a-decay properties of some unknown neutron-deficient actinide nuclei 
316 using machine learning. Chinese Phys. C 46, 024101 (2022). 

317 [35] X.D. Bu, D. Wu, C.L. Bai, Prediction of a-decay half-lives for superheavy nuclei based on neural network (in Chinese). Sci. Sin.-Phys. 
318 Mech. Astron. 52, 252005 (2022). 

319 [36] P. Li, J.H. Bai, Z.M. Niu et al., -decay half-lives studied using neural network method (in Chinese). Sci. Sin.-Phys. Mech. Astron. 52, 
320 252006 (2022). 

321 [37] N.J. Costiris, E. Mavrommatis, K.A. Gernoth et al., Decoding 6-decay systematics: a global statistical model for 8- half-lives. Phys. 
322 Rev. C 80, 044332 (2009). 

323 [38] R. Wang, Y.G. Ma, R. Wada et al., Nuclear liquid-gas phase transition with machine learning. Phys. Research 2, 043202 (2020). 

324 [39] D. Peng, H.L. Wei, J. Pu et al., Bayesian neural network prediction methods for fragment cross sections in proton-induced spallation 


325 reactions (in Chinese). Sci. Sin.-Phys. Mech. Astron. 52, 252012 (2022). 


15 


326 [40] B.C. Wang, M.T. Qiu, W. Chen et al., Machine learning-based analyses for total ionizing dose effects in bipolar junction transistors. 
327 Nucl. Sci. Tech. 33, 131 (2022). 

328 [41] Y.B. Yu, G.F. Liu, W. Xu et al., Research on tune feedback of the Hefei Light Source II based on machine learning. Nucl. Sci. Tech. 33, 
329 28 (2022). 

330 [42] Y.D. Song, R. Wang, Y.G. Ma et al., Determining the temperature in heavy-ion collisions with multiplicity distribution. Phys. Lett. B 
331 814, 136084 (2021). 

332 [43] Q.F. Song, L. Zhu, J. Su, Target dependence of isotopic cross sections in the spallation reactions ?SU + p , d and °Be at 1 AGeV. 
333 Chinese Phys. C 46, 074108 (2022). 

334 [44] H.K. Wu, Y.J. Wang, Y.M. Wang et al., Machine learning method for '? C event classification and reconstruction in the active target 
335 time-projection chamber. Nucl. Instrum. Meth. A (2023). 

336 [45] FP. Li, Y.J. Wang, Z.P. Gao et al., Application of machine learning in the determination of impact parameter in the +3?Sn + !°*Sn 
337 system. Phys. Rev. C 104, 034608 (2021). 

338 [46] Z.Y. Li, Z. Qian, J.H. He et al., Improvement of machine learning-based vertex reconstruction for large liquid scintillator detectors with 
339 multiple types of PMTs. Nucl. Sci. Tech. 33, 93 (2022). 

340 [47] H. Arahmane, EM. Hamzaoui, Y.B. Maissa et al., Neutron-gamma discrimination method based on blind source separation and machine 
341 learning. Nucl. Sci. Tech. 32, 18 (2021). 

342 [48] J. Collado, J.N. Howard, T. Faucett et al., Learning to identify electrons. Phys. Rev. D 103, 116028 (2021). 

343 [49] L. de Oliveira, B. Nachman, M. Paganini, Electromagnetic showers beyond shower shapes. Nucl. Instrum. Meth. A 951, 162879 (2020). 
344 [50] P. Baldi, K. Bauer, C. Eng et al., Jet substructure classification in high-energy physics with deep neural networks. Phys. Rev. D 93, 
345 094034 (2016). 

346 [51] C. Fanelli, J. Pomponi, DeepRICH: learning deeply Cherenkov detectors. Mach. Learn.: Sci. Technol. 1, 015010 (2020). 

347 [52] E. Cisbani, A. Del Dotto, C. Fanelli et al., Al-optimized detector design for the future Electron-Ion Collider: the dual-radiator RICH 
348 case. J. INSTRUM. 15, P05009 (2020). 

349 [53] S. Carboni, S. Barlini, L. Bardelli et al., Particle identification using the AE-E technique and pulse shape discrimination with the silicon 
350 detectors of the FAZIA project. Nucl. Instrum. Meth. A 664, 251-263 (2012). 

351 [54] W. Klempt, Review of particle identification by time of flight techniques. Nucl. Instrum. Meth. A 433, 542-553 (1999). 

352 [55] Y.G. Ma, Effects of a-clustering structure on nuclear reaction and relativistic heavy-ion collisions. Nuclear Techniques 46, 080001 
353 (2023). 

354 [56] J.J. He, W.B. He, Y.G. Ma et al., Machine-learning-based identification for initial clustering structure in relativistic heavy-ion collisions. 
355 Phys. Rev. C 104, 044902 (2021). 

366 [57] Y.G. Ma, S. Zhang, Influence of nuclear structure in relativistic heavy-ion collisions. Handbook of Nuclear Physics 1-30 (2022). 

357 [58] X.G. Cao, E.J. Kim, K. Schmidt et al., Examination of evidence for resonances at high excitation energy in the 7 a disassembly of ?°Si. 
358 Phys. Rev. C 99, 014606 (2019). 

aso [59] X.G. Cao, E.J. Kim, K. Schmidt et al., œ and a conjugate fragment decay from the disassembly of 7°Si at very high excitation energy. 
360 JPS Conf. Proc. 010038 (2020). 

361 [60] X.G. Cao, E.J. Kim, K. Schmidt et al., Evidence for resonances in the 7 a disassembly of 8Si. AIP Conf. Proc. 2038, 020021 (2018). 
sez [61] P. Adamson, M. Youngs, Machine learning: potential application for particle identification. 2019 Fall Meeting of the APS Division of 
363 Nuclear Physics (2019). 

364 [62] S. Wuenschel, K. Hagel, R. Wada et al., NIMROD-ISiS, a versatile tool for studying the isotopic degree of freedom in heavy ion 
365 collisions. Nucl. Instrum. Meth. A 604, 578-583 (2009). 

366 [63] R. Wada, S. Wuenschel, K. Hagel et al., A 47 detector array, NIMROD-ISIS. Nucl. Phys. News 24, 28-33 (2014). 

367 [64] S. Agostinelli, J. Allison, K. Amako et al., GEANT4—a simulation toolkit. Nucl. Instrum. Meth. A 506, 250-303 (2003). 


16 


368 [65] L. Breiman, Random forests. Mach. Learn. 45, 5-32 (2001). 

369 [66] T.Q. Chen, C. Guestrin, XGBoost: a scalable tree boosting system. Proceedings of the 22nd ACM Sigkdd International Conference on 
370 Knowledge Discovery and Data Mining 785-794 (2016). 

371 [67] G.L. Ke, Q. Meng, T. Finley et al., LightGBM: a highly efficient gradient boosting decision tree. Adv. Neur. In. 30, (2017). 

372 [68] A.V. Dorogush, V. Ershov, A. Gulin, CatBoost: gradient boosting with categorical features support. (2018). 

373 [69] S.Ö. Arik, T. Pfister, Tabnet: attentive interpretable tabular learning. In Proceedings of the AAAI Conference on Artificial Intelligence 
374 35, 6679-6687 (2021). 

375 [70] J.Z. Yan, T.Y. Xu, Y.C. Yu et al., Rainfall forecast model based on the tabnet model. Water 13, 1272 (2021). 

376 [71] R. Asencios, C. Asencios, E. Ramos, Profit scoring for credit unions using the multilayer perceptron, XGBoost and TabNet algorithms: 
377 evidence from Peru. Expert Syst. Appl. 213, 119201 (2023). 

378 [72] B. Juba, H.S. Le, Precision-recall versus accuracy and the role of large data sets. Proceedings of the AAAI Conference on Artificial 
379 Intelligence 33, 4039-4048 (2019). 

380 [73] N. Japkowicz, Assessment metrics for imbalanced learning. Imbalanced Learning: Foundations, Algorithms, and Applications 187-206 
381 (2013). 

382 [74] M. Grandini, E. Bagli, G. Visani, Metrics for multi-class classification: an overview. (2020). 

383 [75] M. Hossin, M.N. Sulaiman, A review on evaluation metrics for data classification evaluations. IJDKP. 5, 1 (2015). 

384 [76] E. Mortaz, Imbalance accuracy metric for model selection in multi-class imbalance classification problems. Knowl.-Based Syst. 210, 
385 106490 (2020). 

386 [77] A, Matta, P. Morfouace, N. de Séréville et al., NPTool: a simulation and analysis framework for low-energy nuclear physics experiments. 
387 J. Phys. G. Nucl. Partic. 43, 045113 (2016). 

388 [78] I. Goodfellow, J. Pouget-Abadie, M. Mirza et al., Generative adversarial networks. Commun. ACM 63, 139-144 (2020). 

389 [79] D.P. Kingma, M. Welling, Auto-encoding variational bayes. (2013). 

300 [80] D. Derkach, N. Kazeev, F. Ratnikov et al., Cherenkov detectors fast simulation using neural networks. Nucl. Instrum. Meth. A 952, 
391 161804 (2020). 

392 [81] M. Paganini, L. de Oliveira, B. Nachman, Accelerating science with generative adversarial networks: an application to 3D particle 
393 showers in multilayer calorimeters. Phys. Rev. Lett. 120, 042003 (2018). 

304 [82] D. Salamani, S. Gadatsch, T. Golling et al., Deep generative models for fast shower simulation in ATLAS. IEEE 14th International 
395 Conference on E-Science (e-Science) 348 (2018). 

3% [83] D. Belayneh, F. Carminati, A. Farbin et al., Calorimetry with deep learning: particle simulation and reconstruction for collider physics. 
397 Eur. Phys. J. C 80, 1-31 (2020). 

338 [84] G.L. Ke, Z.H. Xu, J. Zhang et al., DeepGBM: a deep learning framework distilled by GBDT for online prediction tasks. Proceedings of 
399 the 22nd ACM Sigkdd International Conference on Knowledge Discovery and Data Mining 384-394 (2019). 

40 [85] S. Badirli, X.Q. Liu, Z.M. Xing et al., Gradient boosting neural networks: Grownet. (2020). 

41 [86] W.B. He, X.G. Cao, Y.G. Ma et al., Application of EQMD model to researches of nuclear exotic structures. Nuclear Techniques 37 
402 (2014). 

43 [87] X.G. Cao, Y.G. Ma, Progress of theoretical and experimental studies on a cluster structures in light nuclei. Chinese. Sci. Bull. 60, 
404 1557-1564 (2015). 

45 [88] W.B. He, Y.G. Ma, X.G. Cao et al., Dipole oscillation modes in light a-clustering nuclei. Phys. Rev. C 94, 014301 (2016). 

406 [89] W.B. He, Y.G. Ma, X.G. Cao et al., Giant dipole resonance as a fingerprint of a clustering configurations in '*C and ôO. Phys. Rev. 
407 Lett. 113, 032506 (2014). 


